Overview
Brought to you by YData
Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 2825 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 286.9 KiB |
| Average record size in memory | 104.0 B |
Variable types
| Numeric | 12 |
|---|
avg_recency_days is highly overall correlated with frequency | High correlation |
avg_ticket is highly overall correlated with u_basket_size | High correlation |
basket_size is highly overall correlated with gross_revenue and 1 other fields | High correlation |
frequency is highly overall correlated with avg_recency_days | High correlation |
gross_revenue is highly overall correlated with basket_size and 3 other fields | High correlation |
qtde_invoices is highly overall correlated with gross_revenue and 2 other fields | High correlation |
qtde_itens is highly overall correlated with basket_size and 3 other fields | High correlation |
qtde_products is highly overall correlated with gross_revenue and 3 other fields | High correlation |
u_basket_size is highly overall correlated with avg_ticket and 1 other fields | High correlation |
avg_ticket is highly skewed (γ1 = 48.90445591) | Skewed |
frequency is highly skewed (γ1 = 22.8687173) | Skewed |
qtde_returns is highly skewed (γ1 = 50.55990893) | Skewed |
basket_size is highly skewed (γ1 = 45.1395063) | Skewed |
customer_id has unique values | Unique |
recency_days has 34 (1.2%) zeros | Zeros |
avg_recency_days has 51 (1.8%) zeros | Zeros |
qtde_returns has 1524 (53.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-10-31 13:11:10.082946 |
|---|---|
| Analysis finished | 2025-10-31 13:11:24.681617 |
| Duration | 14.6 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
customer_id
Real number (ℝ)
Unique
| Distinct | 2825 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15299.886 |
| Minimum | 12347 |
|---|---|
| Maximum | 18287 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 12347 |
|---|---|
| 5-th percentile | 12626.2 |
| Q1 | 13827 |
| median | 15271 |
| Q3 | 16801 |
| 95-th percentile | 17953.4 |
| Maximum | 18287 |
| Range | 5940 |
| Interquartile range (IQR) | 2974 |
Descriptive statistics
| Standard deviation | 1714.2532 |
|---|---|
| Coefficient of variation (CV) | 0.11204352 |
| Kurtosis | -1.2048524 |
| Mean | 15299.886 |
| Median Absolute Deviation (MAD) | 1484 |
| Skewness | 0.0021078867 |
| Sum | 43222179 |
| Variance | 2938664 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16000 | 1 | < 0.1% |
| 17850 | 1 | < 0.1% |
| 13047 | 1 | < 0.1% |
| 12583 | 1 | < 0.1% |
| 13748 | 1 | < 0.1% |
| 15100 | 1 | < 0.1% |
| 15291 | 1 | < 0.1% |
| 14688 | 1 | < 0.1% |
| 17809 | 1 | < 0.1% |
| 17502 | 1 | < 0.1% |
| Other values (2815) | 2815 |
| Value | Count | Frequency (%) |
| 12347 | 1 | |
| 12348 | 1 | |
| 12352 | 1 | |
| 12356 | 1 | |
| 12358 | 1 | |
| 12359 | 1 | |
| 12360 | 1 | |
| 12362 | 1 | |
| 12364 | 1 | |
| 12370 | 1 |
| Value | Count | Frequency (%) |
| 18287 | 1 | |
| 18283 | 1 | |
| 18282 | 1 | |
| 18273 | 1 | |
| 18272 | 1 | |
| 18270 | 1 | |
| 18265 | 1 | |
| 18263 | 1 | |
| 18261 | 1 | |
| 18260 | 1 |
gross_revenue
Real number (ℝ)
High correlation
| Distinct | 2809 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2879.4906 |
| Minimum | 36.56 |
|---|---|
| Maximum | 279138.02 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 36.56 |
|---|---|
| 5-th percentile | 251.878 |
| Q1 | 614.66 |
| median | 1142.99 |
| Q3 | 2389.1 |
| 95-th percentile | 7538.114 |
| Maximum | 279138.02 |
| Range | 279101.46 |
| Interquartile range (IQR) | 1774.44 |
Descriptive statistics
| Standard deviation | 10856.745 |
|---|---|
| Coefficient of variation (CV) | 3.77037 |
| Kurtosis | 334.71234 |
| Mean | 2879.4906 |
| Median Absolute Deviation (MAD) | 683.75 |
| Skewness | 16.301075 |
| Sum | 8134560.9 |
| Variance | 1.1786891 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 598.2 | 2 | 0.1% |
| 2053.02 | 2 | 0.1% |
| 1078.96 | 2 | 0.1% |
| 331 | 2 | 0.1% |
| 533.33 | 2 | 0.1% |
| 734.94 | 2 | 0.1% |
| 1314.45 | 2 | 0.1% |
| 379.65 | 2 | 0.1% |
| 178.96 | 2 | 0.1% |
| 889.93 | 2 | 0.1% |
| Other values (2799) | 2805 |
| Value | Count | Frequency (%) |
| 36.56 | 1 | |
| 52 | 1 | |
| 52.2 | 1 | |
| 62.43 | 1 | |
| 68.84 | 1 | |
| 70.02 | 1 | |
| 77.4 | 1 | |
| 84.65 | 1 | |
| 90.3 | 1 | |
| 93.35 | 1 |
| Value | Count | Frequency (%) |
| 279138.02 | 1 | |
| 259657.3 | 1 | |
| 194550.79 | 1 | |
| 168472.5 | 1 | |
| 140450.72 | 1 | |
| 124564.53 | 1 | |
| 117379.63 | 1 | |
| 91062.38 | 1 | |
| 72882.09 | 1 | |
| 66653.56 | 1 |
recency_days
Real number (ℝ)
Zeros
| Distinct | 257 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58.084956 |
| Minimum | 0 |
|---|---|
| Maximum | 373 |
| Zeros | 34 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 10 |
| median | 29 |
| Q3 | 74 |
| 95-th percentile | 215 |
| Maximum | 373 |
| Range | 373 |
| Interquartile range (IQR) | 64 |
Descriptive statistics
| Standard deviation | 70.211083 |
|---|---|
| Coefficient of variation (CV) | 1.2087654 |
| Kurtosis | 3.2630081 |
| Mean | 58.084956 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | 1.8725457 |
| Sum | 164090 |
| Variance | 4929.5962 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 99 | 3.5% |
| 4 | 87 | 3.1% |
| 2 | 86 | 3.0% |
| 3 | 85 | 3.0% |
| 8 | 76 | 2.7% |
| 10 | 69 | 2.4% |
| 9 | 66 | 2.3% |
| 7 | 65 | 2.3% |
| 17 | 62 | 2.2% |
| 22 | 56 | 2.0% |
| Other values (247) | 2074 |
| Value | Count | Frequency (%) |
| 0 | 34 | 1.2% |
| 1 | 99 | |
| 2 | 86 | |
| 3 | 85 | |
| 4 | 87 | |
| 5 | 43 | |
| 7 | 65 | |
| 8 | 76 | |
| 9 | 66 | |
| 10 | 69 |
| Value | Count | Frequency (%) |
| 373 | 1 | < 0.1% |
| 372 | 1 | < 0.1% |
| 369 | 1 | < 0.1% |
| 366 | 1 | < 0.1% |
| 360 | 1 | < 0.1% |
| 358 | 3 | |
| 354 | 1 | < 0.1% |
| 337 | 1 | < 0.1% |
| 336 | 2 | |
| 334 | 1 | < 0.1% |
qtde_invoices
Real number (ℝ)
High correlation
| Distinct | 55 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9823009 |
| Minimum | 2 |
|---|---|
| Maximum | 206 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 17 |
| Maximum | 206 |
| Range | 204 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 9.0045848 |
|---|---|
| Coefficient of variation (CV) | 1.5052043 |
| Kurtosis | 186.42448 |
| Mean | 5.9823009 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 10.689372 |
| Sum | 16900 |
| Variance | 81.082548 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 826 | |
| 3 | 503 | |
| 4 | 394 | |
| 5 | 237 | 8.4% |
| 6 | 173 | 6.1% |
| 7 | 138 | 4.9% |
| 8 | 98 | 3.5% |
| 9 | 69 | 2.4% |
| 10 | 55 | 1.9% |
| 11 | 54 | 1.9% |
| Other values (45) | 278 | 9.8% |
| Value | Count | Frequency (%) |
| 2 | 826 | |
| 3 | 503 | |
| 4 | 394 | |
| 5 | 237 | 8.4% |
| 6 | 173 | 6.1% |
| 7 | 138 | 4.9% |
| 8 | 98 | 3.5% |
| 9 | 69 | 2.4% |
| 10 | 55 | 1.9% |
| 11 | 54 | 1.9% |
| Value | Count | Frequency (%) |
| 206 | 1 | |
| 199 | 1 | |
| 124 | 1 | |
| 97 | 1 | |
| 91 | 2 | |
| 86 | 1 | |
| 72 | 1 | |
| 62 | 2 | |
| 60 | 1 | |
| 57 | 1 |
qtde_itens
Real number (ℝ)
High correlation
| Distinct | 1654 |
|---|---|
| Distinct (%) | 58.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1676.8634 |
| Minimum | 2 |
|---|---|
| Maximum | 196844 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 114.2 |
| Q1 | 323 |
| median | 684 |
| Q3 | 1469 |
| 95-th percentile | 4588.4 |
| Maximum | 196844 |
| Range | 196842 |
| Interquartile range (IQR) | 1146 |
Descriptive statistics
| Standard deviation | 6027.3154 |
|---|---|
| Coefficient of variation (CV) | 3.5943987 |
| Kurtosis | 445.02325 |
| Mean | 1676.8634 |
| Median Absolute Deviation (MAD) | 442 |
| Skewness | 17.461199 |
| Sum | 4737139 |
| Variance | 36328531 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 310 | 11 | 0.4% |
| 150 | 8 | 0.3% |
| 246 | 8 | 0.3% |
| 1200 | 7 | 0.2% |
| 493 | 7 | 0.2% |
| 272 | 7 | 0.2% |
| 219 | 7 | 0.2% |
| 394 | 7 | 0.2% |
| 200 | 7 | 0.2% |
| 516 | 7 | 0.2% |
| Other values (1644) | 2749 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 16 | 1 | |
| 17 | 1 | |
| 19 | 1 | |
| 20 | 1 | |
| 24 | 1 | |
| 25 | 1 | |
| 27 | 2 | |
| 30 | 1 | |
| 32 | 1 |
| Value | Count | Frequency (%) |
| 196844 | 1 | |
| 80997 | 1 | |
| 80263 | 1 | |
| 77373 | 1 | |
| 69993 | 1 | |
| 64549 | 1 | |
| 64124 | 1 | |
| 63312 | 1 | |
| 58343 | 1 | |
| 57885 | 1 |
qtde_products
Real number (ℝ)
High correlation
| Distinct | 467 |
|---|---|
| Distinct (%) | 16.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 128.19044 |
| Minimum | 2 |
|---|---|
| Maximum | 7838 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 34 |
| median | 71 |
| Q3 | 142 |
| 95-th percentile | 393.6 |
| Maximum | 7838 |
| Range | 7836 |
| Interquartile range (IQR) | 108 |
Descriptive statistics
| Standard deviation | 275.55026 |
|---|---|
| Coefficient of variation (CV) | 2.1495382 |
| Kurtosis | 341.98106 |
| Mean | 128.19044 |
| Median Absolute Deviation (MAD) | 44 |
| Skewness | 15.457136 |
| Sum | 362138 |
| Variance | 75927.945 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28 | 39 | 1.4% |
| 35 | 36 | 1.3% |
| 27 | 31 | 1.1% |
| 26 | 31 | 1.1% |
| 29 | 30 | 1.1% |
| 31 | 28 | 1.0% |
| 25 | 28 | 1.0% |
| 15 | 28 | 1.0% |
| 19 | 27 | 1.0% |
| 33 | 26 | 0.9% |
| Other values (457) | 2521 |
| Value | Count | Frequency (%) |
| 2 | 11 | |
| 3 | 14 | |
| 4 | 17 | |
| 5 | 16 | |
| 6 | 26 | |
| 7 | 15 | |
| 8 | 14 | |
| 9 | 20 | |
| 10 | 19 | |
| 11 | 23 |
| Value | Count | Frequency (%) |
| 7838 | 1 | |
| 5673 | 1 | |
| 5095 | 1 | |
| 4580 | 1 | |
| 2698 | 1 | |
| 2379 | 1 | |
| 2060 | 1 | |
| 1818 | 1 | |
| 1673 | 1 | |
| 1637 | 1 |
avg_ticket
Real number (ℝ)
High correlation Skewed
| Distinct | 1945 |
|---|---|
| Distinct (%) | 68.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56.841717 |
| Minimum | 2.15 |
|---|---|
| Maximum | 56157.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 2.15 |
|---|---|
| 5-th percentile | 4.762 |
| Q1 | 12.22 |
| median | 17.89 |
| Q3 | 24.98 |
| 95-th percentile | 88.1 |
| Maximum | 56157.5 |
| Range | 56155.35 |
| Interquartile range (IQR) | 12.76 |
Descriptive statistics
| Standard deviation | 1090.5299 |
|---|---|
| Coefficient of variation (CV) | 19.185379 |
| Kurtosis | 2490.1285 |
| Mean | 56.841717 |
| Median Absolute Deviation (MAD) | 6.41 |
| Skewness | 48.904456 |
| Sum | 160577.85 |
| Variance | 1189255.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.49 | 7 | 0.2% |
| 17.66 | 6 | 0.2% |
| 16.82 | 6 | 0.2% |
| 16.53 | 6 | 0.2% |
| 19.06 | 6 | 0.2% |
| 16.92 | 5 | 0.2% |
| 17.71 | 5 | 0.2% |
| 17.13 | 5 | 0.2% |
| 19.44 | 5 | 0.2% |
| 10 | 5 | 0.2% |
| Other values (1935) | 2769 |
| Value | Count | Frequency (%) |
| 2.15 | 1 | |
| 2.43 | 1 | |
| 2.46 | 1 | |
| 2.51 | 1 | |
| 2.52 | 1 | |
| 2.65 | 1 | |
| 2.66 | 1 | |
| 2.71 | 1 | |
| 2.76 | 1 | |
| 2.77 | 1 |
| Value | Count | Frequency (%) |
| 56157.5 | 1 | |
| 13305.5 | 1 | |
| 4453.43 | 1 | |
| 1687.2 | 1 | |
| 1377.08 | 1 | |
| 952.99 | 1 | |
| 872.13 | 1 | |
| 841.02 | 1 | |
| 651.17 | 1 | |
| 640 | 1 |
avg_recency_days
Real number (ℝ)
High correlation Zeros
| Distinct | 1218 |
|---|---|
| Distinct (%) | 43.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 73.125451 |
| Minimum | 0 |
|---|---|
| Maximum | 366 |
| Zeros | 51 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 9.5568627 |
| Q1 | 30 |
| median | 54 |
| Q3 | 92.666667 |
| 95-th percentile | 212.6 |
| Maximum | 366 |
| Range | 366 |
| Interquartile range (IQR) | 62.666667 |
Descriptive statistics
| Standard deviation | 65.559622 |
|---|---|
| Coefficient of variation (CV) | 0.89653631 |
| Kurtosis | 4.1293141 |
| Mean | 73.125451 |
| Median Absolute Deviation (MAD) | 28.714286 |
| Skewness | 1.9103653 |
| Sum | 206579.4 |
| Variance | 4298.0641 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 51 | 1.8% |
| 70 | 20 | 0.7% |
| 31 | 20 | 0.7% |
| 14 | 16 | 0.6% |
| 21 | 16 | 0.6% |
| 46 | 15 | 0.5% |
| 42 | 15 | 0.5% |
| 55 | 15 | 0.5% |
| 49 | 14 | 0.5% |
| 25 | 13 | 0.5% |
| Other values (1208) | 2630 |
| Value | Count | Frequency (%) |
| 0 | 51 | |
| 0.0303030303 | 1 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.3333333333 | 1 | < 0.1% |
| 0.8571428571 | 1 | < 0.1% |
| 1 | 8 | 0.3% |
| 1.5 | 1 | < 0.1% |
| 1.819512195 | 1 | < 0.1% |
| 1.878787879 | 1 | < 0.1% |
| 2 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 366 | 1 | < 0.1% |
| 365 | 1 | < 0.1% |
| 364 | 1 | < 0.1% |
| 363 | 1 | < 0.1% |
| 357 | 2 | |
| 356 | 1 | < 0.1% |
| 355 | 2 | |
| 352 | 1 | < 0.1% |
| 351 | 2 | |
| 350 | 3 |
frequency
Real number (ℝ)
High correlation Skewed
| Distinct | 1226 |
|---|---|
| Distinct (%) | 43.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.087031574 |
| Minimum | 0.0054495913 |
|---|---|
| Maximum | 17 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 0.0054495913 |
|---|---|
| 5-th percentile | 0.0088183697 |
| Q1 | 0.015873016 |
| median | 0.024793388 |
| Q3 | 0.043478261 |
| 95-th percentile | 0.14935245 |
| Maximum | 17 |
| Range | 16.99455 |
| Interquartile range (IQR) | 0.027605245 |
Descriptive statistics
| Standard deviation | 0.436269 |
|---|---|
| Coefficient of variation (CV) | 5.012767 |
| Kurtosis | 810.59065 |
| Mean | 0.087031574 |
| Median Absolute Deviation (MAD) | 0.011000285 |
| Skewness | 22.868717 |
| Sum | 245.8642 |
| Variance | 0.19033064 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 47 | 1.7% |
| 0.0625 | 18 | 0.6% |
| 0.02777777778 | 17 | 0.6% |
| 0.02380952381 | 16 | 0.6% |
| 0.09090909091 | 15 | 0.5% |
| 0.08333333333 | 15 | 0.5% |
| 0.02941176471 | 14 | 0.5% |
| 0.03448275862 | 14 | 0.5% |
| 0.02564102564 | 13 | 0.5% |
| 0.02127659574 | 13 | 0.5% |
| Other values (1216) | 2643 |
| Value | Count | Frequency (%) |
| 0.005449591281 | 1 | < 0.1% |
| 0.005464480874 | 1 | < 0.1% |
| 0.005479452055 | 1 | < 0.1% |
| 0.005494505495 | 1 | < 0.1% |
| 0.005586592179 | 2 | |
| 0.005602240896 | 1 | < 0.1% |
| 0.005617977528 | 2 | |
| 0.00566572238 | 1 | < 0.1% |
| 0.005681818182 | 2 | |
| 0.005698005698 | 3 |
| Value | Count | Frequency (%) |
| 17 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 5 | 0.2% |
| 2 | 47 | |
| 1.142857143 | 1 | < 0.1% |
| 1 | 8 | 0.3% |
| 0.75 | 1 | < 0.1% |
| 0.6666666667 | 3 | 0.1% |
| 0.550802139 | 1 | < 0.1% |
| 0.5335120643 | 1 | < 0.1% |
qtde_returns
Real number (ℝ)
Skewed Zeros
| Distinct | 205 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 63.067611 |
| Minimum | 0 |
|---|---|
| Maximum | 80995 |
| Zeros | 1524 |
| Zeros (%) | 53.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 8 |
| 95-th percentile | 94.8 |
| Maximum | 80995 |
| Range | 80995 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 1550.2251 |
|---|---|
| Coefficient of variation (CV) | 24.580368 |
| Kurtosis | 2633.7647 |
| Mean | 63.067611 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 50.559909 |
| Sum | 178166 |
| Variance | 2403197.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1524 | |
| 1 | 131 | 4.6% |
| 2 | 118 | 4.2% |
| 3 | 82 | 2.9% |
| 4 | 72 | 2.5% |
| 6 | 63 | 2.2% |
| 5 | 56 | 2.0% |
| 12 | 46 | 1.6% |
| 8 | 39 | 1.4% |
| 7 | 38 | 1.3% |
| Other values (195) | 656 |
| Value | Count | Frequency (%) |
| 0 | 1524 | |
| 1 | 131 | 4.6% |
| 2 | 118 | 4.2% |
| 3 | 82 | 2.9% |
| 4 | 72 | 2.5% |
| 5 | 56 | 2.0% |
| 6 | 63 | 2.2% |
| 7 | 38 | 1.3% |
| 8 | 39 | 1.4% |
| 9 | 38 | 1.3% |
| Value | Count | Frequency (%) |
| 80995 | 1 | |
| 9014 | 1 | |
| 8004 | 1 | |
| 4427 | 1 | |
| 3768 | 1 | |
| 3332 | 1 | |
| 2878 | 1 | |
| 2022 | 1 | |
| 2012 | 1 | |
| 1776 | 1 |
basket_size
Real number (ℝ)
High correlation Skewed
| Distinct | 1950 |
|---|---|
| Distinct (%) | 69.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 244.69986 |
| Minimum | 1 |
|---|---|
| Maximum | 40498.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 44.044 |
| Q1 | 102 |
| median | 171.08 |
| Q3 | 277 |
| 95-th percentile | 586.9 |
| Maximum | 40498.5 |
| Range | 40497.5 |
| Interquartile range (IQR) | 175 |
Descriptive statistics
| Standard deviation | 801.56855 |
|---|---|
| Coefficient of variation (CV) | 3.2757214 |
| Kurtosis | 2255.2607 |
| Mean | 244.69986 |
| Median Absolute Deviation (MAD) | 81.42 |
| Skewness | 45.139506 |
| Sum | 691277.1 |
| Variance | 642512.15 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 11 | 0.4% |
| 86 | 9 | 0.3% |
| 75 | 8 | 0.3% |
| 60 | 8 | 0.3% |
| 208 | 7 | 0.2% |
| 73 | 7 | 0.2% |
| 197 | 7 | 0.2% |
| 136 | 7 | 0.2% |
| 105 | 7 | 0.2% |
| 82 | 7 | 0.2% |
| Other values (1940) | 2747 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 3.33 | 1 | |
| 5.33 | 1 | |
| 5.67 | 1 | |
| 6.14 | 1 | |
| 7.5 | 1 | |
| 9 | 1 | |
| 9.5 | 1 | |
| 11 | 1 | |
| 11.88 | 1 |
| Value | Count | Frequency (%) |
| 40498.5 | 1 | |
| 6009.33 | 1 | |
| 3868.65 | 1 | |
| 2880 | 1 | |
| 2733.94 | 1 | |
| 2518.77 | 1 | |
| 2160.33 | 1 | |
| 2082.23 | 1 | |
| 2000 | 1 | |
| 1903.5 | 1 |
u_basket_size
Real number (ℝ)
High correlation
| Distinct | 982 |
|---|---|
| Distinct (%) | 34.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.101989 |
| Minimum | 1 |
|---|---|
| Maximum | 299.71 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 44.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3.382 |
| Q1 | 10.09 |
| median | 17.25 |
| Q3 | 28 |
| 95-th percentile | 56.664 |
| Maximum | 299.71 |
| Range | 298.71 |
| Interquartile range (IQR) | 17.91 |
Descriptive statistics
| Standard deviation | 18.851079 |
|---|---|
| Coefficient of variation (CV) | 0.85291323 |
| Kurtosis | 23.858004 |
| Mean | 22.101989 |
| Median Absolute Deviation (MAD) | 8.25 |
| Skewness | 3.1328747 |
| Sum | 62438.12 |
| Variance | 355.36318 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 45 | 1.6% |
| 14 | 31 | 1.1% |
| 11 | 31 | 1.1% |
| 9 | 27 | 1.0% |
| 7.5 | 27 | 1.0% |
| 1 | 27 | 1.0% |
| 17.5 | 26 | 0.9% |
| 10.5 | 26 | 0.9% |
| 15.5 | 24 | 0.8% |
| 12 | 24 | 0.8% |
| Other values (972) | 2537 |
| Value | Count | Frequency (%) |
| 1 | 27 | |
| 1.2 | 1 | < 0.1% |
| 1.25 | 1 | < 0.1% |
| 1.33 | 2 | 0.1% |
| 1.5 | 8 | 0.3% |
| 1.57 | 2 | 0.1% |
| 1.67 | 4 | 0.1% |
| 1.83 | 1 | < 0.1% |
| 2 | 22 | |
| 2.05 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 299.71 | 1 | |
| 203.5 | 1 | |
| 145 | 1 | |
| 136.12 | 1 | |
| 135.5 | 1 | |
| 122 | 1 | |
| 118 | 1 | |
| 114 | 1 | |
| 110.33 | 1 | |
| 110 | 1 |
Interactions
Correlations
| avg_recency_days | avg_ticket | basket_size | customer_id | frequency | gross_revenue | qtde_invoices | qtde_itens | qtde_products | qtde_returns | recency_days | u_basket_size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| avg_recency_days | 1.000 | -0.068 | -0.026 | -0.031 | -0.971 | -0.341 | -0.453 | -0.316 | -0.287 | -0.215 | 0.186 | 0.061 |
| avg_ticket | -0.068 | 1.000 | 0.203 | -0.146 | 0.062 | 0.284 | 0.099 | 0.203 | -0.375 | 0.197 | 0.031 | -0.623 |
| basket_size | -0.026 | 0.203 | 1.000 | -0.122 | 0.007 | 0.604 | 0.133 | 0.764 | 0.405 | 0.212 | -0.112 | 0.430 |
| customer_id | -0.031 | -0.146 | -0.122 | 1.000 | 0.027 | -0.094 | 0.003 | -0.085 | 0.007 | -0.062 | 0.014 | 0.006 |
| frequency | -0.971 | 0.062 | 0.007 | 0.027 | 1.000 | 0.213 | 0.272 | 0.198 | 0.170 | 0.151 | -0.095 | -0.071 |
| gross_revenue | -0.341 | 0.284 | 0.604 | -0.094 | 0.213 | 1.000 | 0.762 | 0.920 | 0.718 | 0.464 | -0.380 | 0.278 |
| qtde_invoices | -0.453 | 0.099 | 0.133 | 0.003 | 0.272 | 0.762 | 1.000 | 0.704 | 0.659 | 0.430 | -0.452 | 0.019 |
| qtde_itens | -0.316 | 0.203 | 0.764 | -0.085 | 0.198 | 0.920 | 0.704 | 1.000 | 0.706 | 0.426 | -0.373 | 0.310 |
| qtde_products | -0.287 | -0.375 | 0.405 | 0.007 | 0.170 | 0.718 | 0.659 | 0.706 | 1.000 | 0.326 | -0.397 | 0.723 |
| qtde_returns | -0.215 | 0.197 | 0.212 | -0.062 | 0.151 | 0.464 | 0.430 | 0.426 | 0.326 | 1.000 | -0.190 | 0.021 |
| recency_days | 0.186 | 0.031 | -0.112 | 0.014 | -0.095 | -0.380 | -0.452 | -0.373 | -0.397 | -0.190 | 1.000 | -0.110 |
| u_basket_size | 0.061 | -0.623 | 0.430 | 0.006 | -0.071 | 0.278 | 0.019 | 0.310 | 0.723 | 0.021 | -0.110 | 1.000 |
Missing values
Sample
| customer_id | gross_revenue | recency_days | qtde_invoices | qtde_itens | qtde_products | avg_ticket | avg_recency_days | frequency | qtde_returns | basket_size | u_basket_size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 17850 | 5391.21 | 372.0 | 34.0 | 1733.0 | 297.0 | 18.15 | 0.030303 | 17.000000 | 40.0 | 50.97 | 8.74 |
| 1 | 13047 | 3232.59 | 56.0 | 9.0 | 1390.0 | 171.0 | 18.90 | 39.625000 | 0.028302 | 35.0 | 154.44 | 19.00 |
| 2 | 12583 | 6705.38 | 2.0 | 15.0 | 5028.0 | 232.0 | 28.90 | 26.500000 | 0.040323 | 50.0 | 335.20 | 15.47 |
| 3 | 13748 | 948.25 | 95.0 | 5.0 | 439.0 | 28.0 | 33.87 | 69.500000 | 0.017921 | 0.0 | 87.80 | 5.60 |
| 4 | 15100 | 876.00 | 333.0 | 3.0 | 80.0 | 3.0 | 292.00 | 20.000000 | 0.073171 | 22.0 | 26.67 | 1.00 |
| 5 | 15291 | 4623.30 | 25.0 | 14.0 | 2102.0 | 102.0 | 45.33 | 26.769231 | 0.040115 | 29.0 | 150.14 | 7.29 |
| 6 | 14688 | 5630.87 | 7.0 | 21.0 | 3621.0 | 327.0 | 17.22 | 18.300000 | 0.057221 | 399.0 | 172.43 | 15.57 |
| 7 | 17809 | 5411.91 | 16.0 | 12.0 | 2057.0 | 61.0 | 88.72 | 32.454545 | 0.033520 | 41.0 | 171.42 | 5.08 |
| 8 | 15311 | 60767.90 | 0.0 | 91.0 | 38194.0 | 2379.0 | 25.54 | 4.144444 | 0.243316 | 474.0 | 419.71 | 26.14 |
| 9 | 16098 | 2005.63 | 87.0 | 7.0 | 613.0 | 67.0 | 29.93 | 47.666667 | 0.024390 | 0.0 | 87.57 | 9.57 |
| customer_id | gross_revenue | recency_days | qtde_invoices | qtde_itens | qtde_products | avg_ticket | avg_recency_days | frequency | qtde_returns | basket_size | u_basket_size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5638 | 17468 | 137.00 | 10.0 | 2.0 | 116.0 | 5.0 | 27.40 | 4.000000 | 0.400000 | 0.0 | 58.00 | 2.5 |
| 5649 | 13596 | 697.04 | 5.0 | 2.0 | 406.0 | 166.0 | 4.20 | 7.000000 | 0.250000 | 0.0 | 203.00 | 83.0 |
| 5655 | 14893 | 1237.85 | 9.0 | 2.0 | 799.0 | 73.0 | 16.96 | 2.000000 | 0.666667 | 0.0 | 399.50 | 36.5 |
| 5657 | 17852 | 114.34 | 11.0 | 2.0 | 53.0 | 24.0 | 4.76 | 0.000000 | 2.000000 | 0.0 | 26.50 | 12.0 |
| 5674 | 17772 | 182.77 | 10.0 | 2.0 | 58.0 | 53.0 | 3.45 | 0.000000 | 2.000000 | 0.0 | 29.00 | 26.5 |
| 5680 | 14126 | 706.13 | 7.0 | 3.0 | 508.0 | 15.0 | 47.08 | 1.500000 | 0.750000 | 50.0 | 169.33 | 5.0 |
| 5681 | 16479 | 300.83 | 10.0 | 2.0 | 102.0 | 35.0 | 8.60 | 0.000000 | 2.000000 | 0.0 | 51.00 | 17.5 |
| 5686 | 13521 | 1092.39 | 1.0 | 3.0 | 733.0 | 435.0 | 2.51 | 4.500000 | 0.300000 | 0.0 | 244.33 | 145.0 |
| 5696 | 15060 | 301.84 | 8.0 | 4.0 | 262.0 | 120.0 | 2.52 | 0.333333 | 2.000000 | 0.0 | 65.50 | 30.0 |
| 5766 | 16000 | 12393.70 | 2.0 | 3.0 | 5110.0 | 9.0 | 1377.08 | 0.000000 | 3.000000 | 0.0 | 1703.33 | 3.0 |